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(54) SPEECH ENCODING METHOD 
(57)Abstract: 

PURPOSE: To provide the speech encoding method 
which can improve speech quality while suppressing an 
increase in the transmitted information quantity. 
CONSTITUTION: A pitch period analyzing means 1 
extracts the pitch period 8 of an input speech 7 by 
speech frames and a main pulse train generating means 
3 generates a main pulse train 9 corresponding to the 
periods. A subpulse generating means generates a 
pulse sound source vector 10 by adding a subpulse to a 
main pulse by subframes of the speech frames. A 
synthesized speech 1 1 is generated with the pulse 
sound source vector 10 and a linear prediction 

coefficient 12 calculated by a linear predictive analyzing | tte - II 

means, and the error obtained by comparing the 

synthesized speech 1 1 with the input speech 7 is fed by an error minimizing means back to 
the main pulse generating means 3 and subpulse generating means 4 to adds subpulses so 
that the error becomes minimum. The main pulses are set by speech frames and the 
subpulses are set by the subframes and added to the main pulses. 
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* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[DrawingJJ The important section block diagram showing the configuration of the voice to digital 
converter which realizes the voice coding approach of this invention 

JPiawillg.2J The mimetic diagram showing the configuration of the voice frame which generates the 
Maine pulse in the voice coding approach of this invention, and the subframe which stands a subpulse 
[Drawing 3] The spectrum Fig. showing the relation of the Maine pulse and subpulse in the voice coding 
approach of this invention 

[Drawing 4] The important section block diagram showing the configuration of the voice to digital 
converter which realizes the conventional voice coding approach 
[Description of Notations] 

1 Pitch Period Analysis Means 

2 Linear-Predictive-Coding Means 

3 Maine Pulse Train Generation Means 

4 SubPulse Generation Means 

5 Linear Prediction Composition Means 

6 Error Minimization Means 

7 Input Voice 

8 Pitch Period 

9 Maine Pulse Train 

10 Pulse Sound-Source Vector 

1 1 Synthesized Speech 

1 2 Linear Predictor Coefficients 
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DRAWINGS 



[Drawing 1] 
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[Drawing 4] 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Industrial Application] This invention relates to the approach of raising voice quality in the voice 
coding approach of a low bit rate by using the pulse sound-source vector which consists of a subpulse 
stood accommodative before and after the Maine pulse train of a fixed period, and the Maine pulse. 
[0002] 

[Description of the Prior Art] Although efficient digital coding of a low bit rate is used for audio are 
recording and audio transmission in recent years, the voice quality is a technical problem. The approach 
of dividing input voice into a voice frame, performing linear predictive coding to each voice frame by 
the coding approach of a low bit rate, and encoding the obtained linear predictor coefficients and 
prediction residue is in use. Although this prediction residue is encoded by the sound-source vector 
stored in the sound-source sign book, the method of standing 1 or several pulses is in pitch period 
spacing of input voice as one of the approaches of generating a sound-source vector. 
[0003] Hereafter, the generation means of the conventional pulse sound-source vector is explained. 
Drawhlg„4 is the important section block diagram showing the configuration of the voice to digital 
converter which realizes the conventional voice coding approach, and is **. A pitch period analysis 
means for 401 to analyze the pitch period 407 in the voice frame which divided the input voice 9 into 
predetermined time amount length, and to extract in drawing, A linear-predictive-coding means for 402 
to calculate said voice frame by linear prediction, and to output linear predictor coefficients 410, and 
403 input said pitch period 407. A pulse train generation means to generate the pulse train 408 of the 
period, a linear prediction composition means by which 404 generates synthesized speech 409 based on 
a pulse train 408 and linear predictor coefficients 410, 405 is an error minimization means to feed back 
to the pulse train generation means 403 so that an error [ the input voice 406 / synthesized speech / 409 ] 
may be minimized. 

[0004] Processing actuation of the sound-source vector generation means of the above-mentioned 
configuration is explained. The pitch period analysis means 401 extracts the pitch period 407 from input 
voice. The pulse train generation means 403 generates the pulse train 408 stood at intervals of the pitch 
period 407 which detected 1 or several pulses of the configuration defined beforehand. The linear 
prediction composition means 404 generates synthesized speech 409 by making a pulse train 408 into a 
pulse sound source using the linear predictor coefficients 410 computed by the linear-predictive-coding 
means 402. The error minimization means 405 compares the input voice 406 with synthesized speech 
409, and it applies feedback to the pulse train generation means 403 so that the error may become min. 
The optimal pulse sound-source vector is generated by the above process. Thus, using the generated 
pulse sound-source vector as a sound source has effectiveness in improvement in voice quality, such as a 
standup part of a voiced sound. 
[0005] 

[Problem(s) to be Solved by the Invention] In such a conventional sound-source vector generation 
method, since a pulse sound-source vector consisted of 1 or several pulses of a predetermined 
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configuration, only the pulse sound-source vector which has the characteristic period detected per voice 

frame was reproduced and a finer pulse sound source was not reproduced, there was a problem that 

sufficient pulse sound-source vector to approximate prediction residue was not acquired. 

[0006] This invention solves the above-mentioned technical problem, the prediction remainder can be 

approximated the optimal, the high pulse sound-source vector of power of expression can be generated, 

and it aims at offering the voice coding approach which can improve voice quality. 

[0007] 

[Means for Solving the Problem] The Maine pulse train corresponding to [ this invention carries out / 
voice / input / pitch period analysis, in order to attain the above-mentioned purpose, and ] the pitch, 
Stand 1 thru/or two or more subpulses accommodative within the predetermined section before and 
behind each Maine pulse of said Maine pulse train, generate a pulse sound-source vector, and it 
considers as a sound source. And it is the voice coding approach which sets the period of said Maine 
pulse train constant within a voice frame, stands said subpulse to said each Maine pulse in the subframe 
unit which divided said voice frame, and was encoded. 
[0008] 

[Function] In the configuration of the above [ this invention ], the pulse sound-source vector which 
added the subpulse to the Maine pulse has high power of expression, and the increment in the 
transmission amount of information by the information on a subpulse establishing a subpulse by 
carrying out to every subframe by raising tone quality and using information on the Maine pulse train as 
every voice frame is controlled. 
[0009] 

[Example] Hereafter, it explains, referring to a drawing about one example of the voice coding approach 
of this invention. Drawing 1 is the important section block diagram showing the configuration of the 
voice to digital converter which realizes the voice coding approach of this example. A pitch period 
analysis means for 1 to analyze the input voice 7 and to extract the pitch period 8 in drawing, A linear- 
predictive-coding means for 2 to carry out [ voice / input ] linear predictive coding, and to calculate and 
output linear predictor coefficients 12, A Maine pulse train generation means for 3 to generate the Maine 
pulse train 9 based on the pitch period 8, and to output, A subpulse generation means for 4 to carry out 
generation addition of the subpulse to the Maine pulse train 9, and to generate and output the pulse 
sound-source vector 10, A linear prediction composition means for 5 to generate synthesized speech 1 1 
by the pulse sound-source vector 10 and linear predictor coefficients 12, and to output, 6 is an error 
minimization means to feed back to the Maine pulse generation means 3 and the subpulse generation 
means 4 so that an error [ the input voice 7 / synthesized speech /ll] may be made into min. Moreover, 
dmwing.3 is the mimetic diagram showing the relation between the Maine pulse and a subpulse, and, as 
for the Maine pulse and 302, 301 is [ a subpulse and 303 ] the subpulse retrieval sections in drawing. 
[0010] The actuation is explained in the above-mentioned configuration. The pitch period analysis 
means 1 extracts the pitch period 8 in the voice frame of the predetermined time length which divided 
the input voice 7. The Maine pulse train generation means 3 generates the Maine pulse train 9 which 
makes the pitch period 8 pulse separation using a sign book etc., and the subpulse generation means 4 
stands a subpulse accommodative to the Maine pulse train 9, and it generates the pulse sound-source 
vector 10. The linear prediction composition means 5 generates synthesized speech 1 1 from the linear 
predictor coefficients 12 and the pulse sound-source vector 10 which were computed by the linear- 
predictive-coding means 2. The error minimization means 6 compares the input voice 7 with synthesized 
speech 1 1, and it applies feedback to the Maine pulse train generation means 3 and the subpulse 
generation means 4 so that the error may become min. The optimal pulse sound-source vector 10 is 
generated by the above process. 

[001 1] How to stand a subpulse accommodative in the above-mentioned actuation is explained. Drawing 
3 is the mimetic diagram showing the relation between the Maine pulse 301 and the subpulse 302. In 
drawing, to the Maine pulse 301, the fixed range of a certain before and behind time is appointed at the 
subpulse retrieval section 303, and the subpulse 302 is stood to the location which makes the error of 
synthesized speech 1 1 min within the subpulse retrieval section 303. The sign which shows the distance 
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from the Maine pulse 301, the sign which shows relative magnitude with the Maine pulse 301, and the 
sign which shows that positive/negative can express this subpulse 302. 

[0012] Below, how to encode the pulse sound-source vector 10 is explained, the input voice 7 was 
shown in drawing 2 as -- a, b, and c — it divides into the voice frame shown by and the subframe 
shown by 1,2, 3, and 4. Although generation of the pulse sound-source vector 10 is performed for every 
subframe, in order to reduce amount of information, the pulse separation and the pulse position of the 
Maine pulse train 9 are determined for said every voice frame. That is, the Maine pulse train 9 can be 
expressed only with the location of a head pulse, and the gain within each subframe by performing pitch 
period analysis once to each voice frame, and seting constant the pulse separation in the same voice 
frame. When processing which stands a subpulse is performed for every subframe on the other hand and 
two or more Maine pulses are in a subframe, the same subpulse as all the Maine pulses in the same 
subframe shall be stood. The pulse sound-source vector 10 is encoded by the above approach by the 
relative magnitude to the location of the gain of the location of the head Maine pulse in a voice frame, 
spacing of the Maine pulse, and the Maine pulse in a subframe, and the subpulse in a subframe, and the 
Maine pulse of a subpulse, and the sign which shows the positive/negative of a subpulse. 
[0013] By the above-mentioned configuration, since the configuration of a pulse train changes 
accommodative, the generated pulse sound-source vector 10 has power of expression higher than the 
pulse sound-source vector by the pulse train of the configuration to which the former was fixed, and its 
quality of synthesized speech improves. 

[0014] In addition, although this example explained the case where one subpulse was stood to each 
Maine pulse, it cannot be overemphasized that two or more subpulses may be established. 
[0015] 

[Effect of the Invention] The Maine pulse train corresponding to [ this invention carries out / voice / 
input / pitch period analysis so that clearly from the above explanation, and ] the pitch, Stand 1 thru/or 
two or more subpulses accommodative within the predetermined section before and behind each Maine 
pulse of said Maine pulse train, generate a pulse sound-source vector, and it considers as a sound source. 
And said subpulse can raise voice quality by standing to said each Maine pulse and having made it 
encode in the subframe unit which divided said voice frame by seting the period of said Maine pulse 
train constant within a voice frame, controlling the increment in amount of information. 

[Translation done] 



http ://www4 . ipdl j po. go.j p/cgi-bin/tran_web_cgi_ejj e 



4/13/04 



Page 1 of 1 



* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

l.This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2 **** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] The Maine pulse train corresponding to [ carry out / voice / input / pitch period analysis, and ] 
the pitch, Stand 1 thru/or two or more subpulses accommodative within the predetermined section 
before and behind each Maine pulse of said Maine pulse train, generate a pulse sound-source vector, and 
it considers as a sound source. And it is the voice coding approach which set the period of said Maine 
pulse train constant within a voice frame, stands said subpulse to said each Maine pulse in the subframe 
unit which divided said voice frame, and was encoded. 
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